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Abstract. We investigate the consequences of a measure preserving action of 
a finitely generated free group having finite f-invariant entropy. We find that 
for such actions the stabilizer of almost every point must be either trivial or of 
finite index, and the invariant measure must be a linear combination of only 
countably many ergodic measures. If the action is furthermore ergodic, then 
there is an integer n such that for every non-trivial normal subgroup K the 
number of A'-ergodic components is at most n. These results are derived from 
a new formula for f-invariant entropy. 



1. Introduction 

Recently Lewis Bowen 4 denned a numerical measure conjugacy invariant for 
actions of finitely generated free groups, called f-invariant entropy. The f-invariant 
entropy is relatively easy to calculate, has strong similarities with the classical 
Kolmogorov-Sinai entropy of actions of amenable groups, and in fact agrees with 
the classical Kolmogorov-Sinai entropy when the finitely generated free group is 
simply Z. Moreover, f-invariant entropy is essentially a special, simpler case of the 
recently emerging entropy theory of sofic group actions being developed by Bowen 
(0, 0, 0), Kerr-Li ([15], [TB], [17]), Kerr ([TJ]), and others ([IT], [2J, [25]). The 
classical Kolmogorov-Sinai entropy has unquestionably been a fundamental and 
powerful tool in the study of actions of amenable groups, and f-invariant entropy 
seems posed to take a similar role in the study of actions of finitely generated free 
groups. There is therefore a significant need to develop and understand the theory 
of f-invariant entropy. This paper serves as a piece of this large program. We study 
how f-invariant entropy, or more specifically the property of having finite f-invariant 
entropy, relates to the qualitative dynamical properties of the action. 

Let us define f-invariant entropy. Let G be a finitely generated free group, let S 
be a free generating set for G, and let G act on a standard probability space (X, //) 
by measure preserving bijections. If a is a countable measurable partition of X and 
F C G is finite, then we define 

F-a= V f-a. 

feF 

Recall that the Shannon entropy of a countable measurable partition a of X is 



H(a) = y>A*(4)-log(/i(A)). 



Also recall that a is generating if the smallest G-invariant er-algebra containing a 
contains all measurable sets up to sets of measure zero. If there exists a generating 
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partition a having finite Shannon entropy, then the f-invariant entropy of this action 
is defined to be 

f G {X,n) = lim F G (X,fi,S,B n -a), 

where 

F a (X, /i, S, p) = (1 - 2r) • H(/3) + £ H(a • /3 V /3) 

sGS 

and r = |5| is the rank of G and B„ is the ball of radius n centered on 1 G with 
respect to the generating set S. Surprisingly, Bowen proved in [3] and [5] that 
the above limit always exists (the terms in the limit are non-increasing) and the 
value fc{X,n) neither depends on the choice of free generating set S nor on the 
choice of finite Shannon entropy generating partition a. If there is no generating 
partition for this action having finite Shannon entropy, then the f-invariant entropy 
is undefined. 

Unlike Kolmogorov-Sinai entropy, f-invariant entropy may be negative. In fact 
for some actions (J,) = — oo. However, one always has fo(X,fx) < H(a) < oo 

([4]). Thus the conditions that fa{X, ^) is finite and fdX, m) 7^ are equivalent. 

The goal of this paper is to expand our knowledge on how f-invariant entropy 
is related to qualitative dynamical properties. There have previously only been a 
few results of this type. Bowen proved that Bernoulli shifts over finitely generated 
free groups are classified up to measure conjugacy by their f-invariant entropy, 
when it is defined [4] , and he proved that actions with negative f-invariant entropy 
cannot be factors of Bernoulli shifts [6]. In [10J Proof of Lemma 3.5] Bowen and 
Gutman showed that for any action G rx (X, y) on an atomless probability space 
with defined and finite f-invariant entropy, there must be a cyclic subgroup of G for 
which the induced action of this subgroup has infinite Kolmogorov-Sinai entropy. 
Finally, in [20j Corollary 1.2] the author showed that if G rx (X, /i) has defined and 
positive f-invariant entropy and K <\ G is any infinite index normal subgroup with 
G/K residually finite, then the restricted action K rx (X,fi) does not admit any 
generating partition having finite Shannon entropy. 

Our results will assume very little of the action. We will only assume that the f- 
invariant entropy be defined and be finite, and sometimes we will assume ergodicity. 
Our main theorem is below. Before stating this theorem we remind the reader that 
a point y in a probability space (Y, v) is an atom if v({y}) > 0. 

Theorem 1.1. Let G be a finitely generated non-cyclic free group acting on a 
standard probability space (X, /i) by measure preserving bijections. Assume that 
fG(X,fi) is defined. If fc{X, fi) ^ — oo and (Y, v) is any factor of {X, /i), then for 
v-almost every y EY, the stabilizer of y is either trivial or has finite index in G. 
Furthermore, v-almost every y G Y with non-trivial stabilizer is an atom, and thus 
there are essentially only countably many points with non-trivial stabilizer. 

We mention that there are examples of actions of finitely generated free groups 
with defined and finite f-invariant entropy which admit factors whose f-invariant 
entropy is not defined (see [3] and [T7]). 

The above theorem says that, ignoring atoms, all actions with finite f-invariant 
entropy and all of their factors are essentially free. Since every Bernoulli shift over a 
finitely generated free group is a factor of a Bernoulli shift with defined and finite f- 
invariant entropy ([3]), this theorem implies that all non-trivial factors of Bernoulli 
shifts over finitely generated free groups are essentially free. This fact also follows 
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from a recent result of Robin Tucker-Drob [22 which completely characterizes those 
groups G for which every non-trivial factor of a Bernoulli shift over G is essentially 
free. Ornstein |19) proved that factors of Bernoulli shifts over Z are again Bernoulli 
shifts, but it is not known if factors of Bernoulli shifts over free groups are Bernoulli 
shifts. 

For some groups, the stabilizers which can appear in probability measure pre- 
serving actions are quite restricted (such as for higher rank semi-simple groups, 
by a well known result of Stuck and Zimmer [21]). However, a recent paper by 
Bowen 9, Theorem 3.1 and Remark 3] shows that probability measure preserving 
actions of free groups have a "zoo" of possible stabilizers. Thus the above theorem 
demonstrates a significant restriction imposed by having finite f-invariant entropy. 

We derive the following corollary which exhibits a remarkable restriction on the 
ergodic decompositions of induced actions of normal subgroups. 

Corollary 1.2. Let G be a finitely generated non-cyclic free group acting on a 
standard probability space (X, /i) by measure preserving bisections. Assume that 
fc(X,fj,) is defined. If G rx (X,fi) is ergodic and fc(X,fi) =/= — oo, then there is 
n£N such that for every non-trivial normal subgroup K <\G the number of ergodic 
components of K rx (X, /i) is at most n. 

In particular, if T < G contains a non-trivial normal subgroup of G then the 
number of ergodic components of T rx (X, fi) is at most n. Notice that the subgroup 
r is not required to have finite index in G. After proving the above corollary in 
Section [5l we will present a construction due to Lewis Bowen which demonstrates 
that the above corollary cannot in general be extended to hold for all non-trivial 
subgroups of G (see Proposition 15. 6|) . 

We prove our main theorem, Theorem 1 by studying the ergodic components 
of the action. This requires us to understand how f-invariant entropy behaves with 
respect to ergodic decompositions. We obtain the following. 

Theorem 1.3. Let G be a finitely generated free group of rank r acting on a 
standard probability space (X, ^i) by measure preserving bijections. Assume that 
fc(X,fi) is defined. If t is the ergodic decomposition of /i then fc(X,v) is defined 
for T-almost every ergodic measure v and 



From this we obtain another consequence of having finite f-invariant entropy. 

Corollary 1.4. Let G be a finitely generated non-cyclic free group acting on a 
standard probability space (X,/i) by measure preserving bijections. Assume that 
fo{X, fi) is defined. If fa{X, /i) =/= -co then /i is the linear combination of countably 
many ergodic measures. 

The proof of the main theorem relies on a new formula for f-invariant entropy. 
Let us briefly describe this new formula. Fix a free generating set S for G and 
let \g\ denote the reduced S-word length of g G G. We define a well ordering < 
on G as follows. If \g\ < \h\ then we declare g < h. If \g\ — \h\ then we use a 
fixed total ordering of S U S" -1 and declare g ^< h if and only if the reduced S- word 
representation of g lexicographically precedes the reduced S'-word representation of 
h. For g G G we let V(g) be the set of all group elements which strictly precede g. 
Let G act by measure preserving bijections on a standard probability space (X, fi). 
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Assume that there is a generating partition a having finite Shannon entropy. Fix 
1g / 5 £ G. Let s 6 S U S^ 1 be such that | s 1 .9 1 = \g\ — 1. The independence 
decay at g relative to (S, a) is 

5(g, S, a) := R^g ■ a/F^g) ■ a) - R(g ■ a/f(g) ■ a). 

We remark that 5(g,S,a) > 0. When S(g,S,a) = 0, the partitions g ■ a and 
F(g) ■ a are as independent as possible while respecting the fact that G preserves 
the measure and while keeping F(s~ 1 g) ■ a fixed. 

Theorem 1.5. Let G be a finitely generated free group acting on a standard proba- 
bility space (A, fi) by measure preserving bisections. Assume that this action admits 
a generating partition a having finite Shannon entropy. If S is a free generating 
set for G then 

/ G (A,/i) =!%)--• J2 S (9,S,a). 

Organization. Notation, definitions, and some facts regarding f-invariant entropy 
are discussed in Section [2] In Section |3] we obtain two new formulas for f-invariant 
entropy and prove Theorem 11.51 We also prove that actions with finite f-invariant 
entropy cannot factor through a proper quotient of G. We study ergodic decompo- 
sitions in Section|3]and prove Theorem ll.3l and Corollarv ll.4l The main theorem of 
the paper, Theorem 1 1.11 is proved in Section [SJ At the end of Section we prove 
Corollary 11.21 and present a construction due to Lewis Bowen (Proposition I5.6[) . 

Acknowledgments. This material is based upon work supported by the National 
Science Foundation Graduate Student Research Fellowship under Grant No. DGE 
0718128. The author would like to thank his advisor, Ralf Spatzier, for helpful 
conversations. The author would also like to thank Lewis Bowen for sharing a 
construction (Proposition 15. 6|) which demonstrates that Corollary 11.21 cannot be 
strengthened to hold for all non-trivial subgroups. 

2. Preliminaries 

Let G be a finitely generated free group and let S be a free generating set for 
G. The rank of G is the minimum size of a generating set for G, which in this case 
is simply If g E G then the reduced S-word representation of g is the unique 
(possibly empty) sequence (si, S2, ■ ■ ■ , s n ) where each Sj E S U S~ x , s, ^ s^ +1 , and 
g — S1S2 • ■ • s n - The reduced S -word-length of g E G, denoted \g\, is the length of 
the reduced S-word representation of g. We let B„ = {g £ G : \g\ < n} be the ball 
of radius n and S n = B„ \ B„_i be the sphere of radius n. We do not emphasize 
the dependence of B„ and S„ on S as we will never use more than one generating 
set for G simultaneously. The left S-Cayley graph of G is the graph with vertex set 
G and edge set {{g, sg) : g E G, s E S U S^ 1 }. 

We will use the term probability space to always mean a standard Borel space 
equipped with a Borel probability measure. We will assume that all actions on 
probability spaces are by measure preserving bijections. If G acts on (X, /i) then 
we let dJl(X) denote the set of G-invariant Borel probability measures on X and 
we let <B(X) C 971(A) denote the ergodic measures. When needed, we will write 
DJIg(X) and <Bg(X) to distinguish the acting group. The set 931(A) is naturally 
a standard Borel space; its collection of Borel sets is defined to be the smallest 
er-algebra making the maps v E 9Jl(A) h- > v{B) measurable for every Borel set 
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fiCI |13[ Theorem 17.24]. If G acts on (A, fi) then the ergodic decomposition of 
\x is the unique Borel probability measure r on 9Jt(A) satisfying t(<E(X)) = 1 and 
fi = J vdr (meaning n(B) = J v(B)dr for every Borel set B C X). If {ai : i E 1} 
is a finite collection of countable measurable partitions of X, then we let \f ieI on 
denote the coarsest measurable partition of X which is finer than each o^. If 7 
is infinite then we let Vie/ a » be the smallest er-algebra containing every class of 
every o^. If G r% [X,fjb) and a — {Ai : i £ 7} is a countable measurable partition 
of X, then we define g ■ a = {g ■ Ai : i £ 7}. For F C G (finite or infinite) we 
define F ■ a — V/eF f ' a - A countable measurable partition a is generating for 
G rv (X, /i) if for every Borel set S C X there is fl' £ G • a with /.i(BAB') = 0. 
If a and /3 are countable measurable partitions of X then the conditional Shannon 
entropy of a relative to ft is 

If /3 = {X} is the trivial partition then H(a//3) equals the Shannon entropy of a, 
H(a), as defined in the introduction. When needed we write H M (a//3) and H^(a) to 
clarify the measure being used. If (X, fi) is a probability space then H(/i) is defined 
as the supremum of H /i (a) over all finite measurable partitions a of X. An atom 
of /i is a point x £ X with /i({x}) > 0. A probability measure is purely atomic if 
the complement of the set of atoms has measure 0. Since every probability space 
(in our sense) is isomorphic to an interval of the real line with Lebesgue measure 
together with a countable number of atoms, it is easy to show that /x is purely 
atomic if H(/i) < oo. The converse does not hold. 

The following lemma consists of some well known facts on Shannon entropy 
which we will need. The reader can consult [53] for a proof. 

Lemma 2.1. Let (A, /i) be a standard probability space, let a,j3,£ be countable 
measurable partitions of X . Then 

(i) H(a//3) > 0; 

(ii) R(aV[3)=R(a/(3)+K(f3); 

(iii) H(q V [3/0 = E(a/[3 V £) + H(/3/0; 

(iv) H(a//3V0 <H(a//3); 

(v) H(aV£//3v£)=H(a//3V£). 

We will later need the following theorem which places a restriction on the f- 
invariant entropy of factors. 

Theorem 2.2 (Bowen, [6:). Let G be a finitely generated free group acting on a 
probability space (A,/x). Assume that this action admits a generating partition a 
with H(a) < oo. If (Y, v) is a factor of (A, \x) and fc(Y, v) is defined, then 

f G (Y,v)>f G (X,n)-K(a). 

We also observe a simple lemma for later reference. 

Lemma 2.3. Let G be a finitely generated free group of rank r acting on a proba- 
bility space (A, /i). IfH(fi) < oo then /g(A, fx) — — (r — l)H(/i). 

Proof. As H(/i) < oo, /i must be purely atomic. We can therefore partition A 
so that every atom of fi is a class of this partition. Call this partition a. One 
readily has H(oj) = H(/i) < oo and a is generating. Also, for finite K C G the 
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partitions K ■ a and a are, modulo sets of measure zero, identical. Therefore 
F G {X,n,S,K ■ a ) = -(r- l)H(a) = -{r - l)H(/x) for every finite K C G. □ 

We point out that since — (r — 1) • H(/z) < 0, our main theorem implies that any 
ergodic action with /z) > is essentially free. 

3. F-Invariant Entropy Via Independence Decay 

In this section we prove that f-invariant entropy can be computed from what we 
call independence decay. Obtaining this new formula for f-invariant entropy is a 
key ingredient to the proof of the main theorem. 

We begin with a simple lemma which will allow us to simplify the formula for 
f-invariant entropy. 

Lemma 3.1. Let G be a finitely generated free group of rank r and let S be a free 
generating set for G. If G acts on a probability space (X, fi) and a is a countable 
measurable partition of X then 

H(B n+1 • a/B n ■ a) < ]T H(sB„ • a/B 

sesus- 1 

and 

H(sB„+i ■ a/Bn+i ■ a) < (2r - 1) ■ H(B n+1 • a/B n ■ a). 

sesus- 1 

Proof. We begin with the first inequality. Enumerate S U S" -1 as t\, t%, . . . , t?, r . Set 

j<i 

Since B n+ i is the union of the i,-B n 's, we have by Lemma |2. II 

It 

H(B„+i • a/B n -a) = ^2 H(^B n • a/B n -aVAi-a) 

2r 

< J2 H (^ B « • «/ B « ■ = H ( sB ™ ' a / B ™ ' 

Thus we have the first inequality. 

Now we consider the second inequality. Fix seSU S^ 1 . Set 

C s = I tB n J \ B„ C S„ + i. 

So C s is the set of g € S„ + i whose reduced S'-word representations do not begin on 
the left with s _1 . Notice that every g £ S„+i lies in precisely (2r — 1) many C s 's. 
Also notice that 

sB n+ i \ B n+ i = sC s . 

Fix a total ordering, <, of S„+i. For g 6 S„+i, let P(g) be the set of elements of 
which strictly precede g. For g 6 C s we may not have P{g) C C s , however 
we do have 

sP(g) C B„ U s • (P(<?) nC,). 

So by Lemma [2~T1 

H(sB n+ i • a/B n+ i ■ a) = H(sC s • a/B n+1 ■ a) 
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= H ( s 3 • «/ B «+i ' a V s(P(g) nC s )-a)= ^ H(s 5 • a/B n+1 ■ a V sP{g) ■ a) 

gec s g eC 3 

< ^ R(sg ■ a/sB n ■ a V sP{g) ■ a) = H (# • a/B» ■ a V • a). 

S6C 3 gGC 3 

Therefore 

5^ H(sB„ +1 • a/B n+1 • a) < ^ ^ R(g ■ a/B n ■ a V P{g) • a) 

sesus- 1 sesus- 1 g£C s 

= (2r-l) ]T H(g ■ a/B n ■ a V P( ff ) • a) = (2r - l)H(B n+1 • «/B n • a). 

ges n+1 

This completes the proof. □ 

We now obtain a somewhat simpler formula for f-invariant entropy. 

Lemma 3.2. Let G be a finitely generated free group of rank r and let S be a free 
generating set for G. Let G act on a probability space {X,n). Assume that there is 
a generating partition a having finite Shannon entropy. Then 

f G (X, fi) = lim (1 - r) • H(B„ • a) + J • H(B„ +1 • a/B n ■ a). 

n— too 2 

Proof. Define 

F' G {X, n, S, a, n) = (1 - r) • H(B„ • a) + i • H(B„ +1 • a/B n ■ a), 
/g( X >^) = I™ F' G {X^,S,u,n). 

n— too 

Since the action of G preserves measure we have 

F G (X, n, S, B n ■ a) = (1 - 2r)H(B„ • a) + ^ H(sB„ • a V B n • a) 



ses 



(1 - 2r)H(B„ • a) + - • 51 H ( sB " ' a V B " ' a ) 
= (l-r)H(B n -a) + i- £ H(sB n • a/B n ■ a), 



2 



where the last equality follows from Lemma [2.1l So by the first inequality of Lemma 
13.11 we have 

F G (X,fi,S,a,n) < F G (X, /i, S,B n - a) 

for every n G N. Thus / G (X,/x) < f G (X,»). 

If f G (X,[i) = — oo, then we have f G (X,fi) = f G (X,[i) as claimed. So suppose 
that f G (X,/i) 7^ — oo. Then we have 

= lim 2 • F G (X, /*, 5, B n • a) - 2 • M, 5, B n+1 • a) 

n— ^oo 

= lim 2(1 - r)H(B„ • a) - 2(1 - r)H(B n+1 • a) 

n oo 

+ [H( sB « • «/B„ • a) - H(sB n+1 • a/B n+1 ■ a)] 

sesus- 1 

= lim (2r - 2) • H(B„+i • a/B„ • a) 

n— too 

+ ^ [H(sB„ • a/B n ■ a) - H(sB n+1 • a/B n+1 • a)] 

sesus- 1 
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= lim V H(aB„ • a/B n ■ a) - H(B n+1 • a/B n ■ a) 

n— > oo ^ — * 

seSUS" 1 

+(2r - 1) • H(B n+1 • a/B n • a) - ^ H(sB„ +1 • a/B n+1 • a). 

sesus- 1 

The expression appearing in the last line and the expression appearing in the second 
to last line are both non- negative by Lemma |3. II Since the limit is 0, we must have 
that fc(X, /i) ^ — oo implies 

(3.1) lim y H(sB„ • a/B n ■ a) - H(B, i+1 • a/B n ■ a) = 0, 

n— >oo * — ' 

sesus- 1 

and 

lim (2r - 1) ■ H(B n+ i ■ a/B„ - a) - V H(sB n+1 • a/B n+1 ■ a) = 0. 

sesus- 1 

In particular, when fc(X,fi) ^ — oo we have 

lim F G (X, ^i, 5", B n • a) - F' G (X, n, S, a,n) = 

by Equation 13. II Thus /^(X,^) = fc(X,fi) in all cases. □ 

Corollary 3.3. Let G be a finitely generated free group of rank r > 1 and let S be 

a free generating set for G. Let G act on a probability space {X, fi). Assume that 
there is a generating partition a having finite Shannon entropy. 7//g(JT, /i) =/= — oo 
and [i is not purely atomic then 



lim \/H(B n+ i • a/B n ■ a) = 2r - 1. 

We mention that only a weak form of this corollary is needed to establish the 
main results of this paper. The weaker fact which is needed is that the assumptions 
above imply that H(B, i+ i -a/B n -a) tends to infinity. This weaker fact is established 
in the first paragraph of the proof below, and the reader may skip the rest of the 
proof if desired. The full strength of the above corollary is only used in proving 
Corollary [331 an d that corollary is independent of the rest of the paper. 

Proof. Since /i is not purely atomic and a is generating we must have 

lim H(B„ • a) = H(/i) = +oo. 

By Lemma 13.21 we have 



= lim (1 - r) • H(B„ ■ a) + J • H(B n+1 • a/B n ■ a) - f G (X, p). 

Since r > 1 it follows that 

lim H(B n+ i • a/B n ■ a) = +00. 

n— >oo 

In the proof of the previous lemma we showed that when /i) ^ —00 we 

have 

lim V H(sB„ • a/B n ■ a) - H(B n+1 • a/B n ■ a) = 0, 

n— >oo * — ' 

sesus- 1 

and 

lim (2r - 1) • H(B n+1 • a/B n -a)- V H(sB n+1 • a/B n+1 ■ a) = 0. 

n—too z — ' 
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From these two equations it follows that 

lim (2r - 1) • H(B„ +1 • a/B n ■ a) - H(B„ +2 • a/B n+1 ■ a) = 0. 

n— >oo 

Lemma 13.11 implies that each term in the above limit is non-negative. By the 
previous paragraph, we can find M £ N so that H(B n+ i • a/B n • a) > 1 for all 
n > M. Fix < e < 1 and let TV > M be such that 

< (2r - 1) ■ H(B n+ i • a/B n ■ a) - H(B„ +2 • a/B n+1 ■ a) < e 

for all n > N. So for n > N 

(2r-l)-H(B n+ i-a/B B .a)-e < H(B n+2 -a/B n+1 -a) < (2r-l)-H(B n+1 -a/B„-a). 
Since this holds for all n > N, by induction we have 

(2r - l) m • H(Bat +1 • a/B N ■ a) > H(B w+m+1 • a/B N+m ■ a) 

m— 1 

> (2r - l) m • H(B Ar+1 • a/B N ■ a) - e ■ ^ (2r - 1)* 

i=0 

= (2r - l) m ■ H(B,v +1 • a/B N ■ a) - ^— • ((2r - l) m - 1) 

- - I)™ ■ (H(B N+1 • a/Bjv • a) - ^) + 
Since N > M we have H(Bat + i • a/BAr • a) — > ®' ^° we obtain 



lim N+ VH(BAr +m+ i • a/B N+m ■ a) = 2r - 1. 

m— >oo 

This completes the proof. □ 

The previous corollary allows a simple proof that free group actions which factor 
through a proper quotient of G must have f-invariant entropy negative infinity, 
provided it is defined and the space is not purely atomic. This is an extremely 
weak version of our main theorem and is itself a new result. We include this 
corollary because it is quite interesting and its proof is substantially simpler than 
the proof of the main theorem. 

Corollary 3.4. Let G be a finitely generated non-cyclic free group and let K < G 
be a non-trivial normal subgroup. Let G act on a probability space (X,fi). Assume 
that fc(X^fi) is defined. If this action factors through G/K and fi is not purely 
atomic then fc(X^) = —00. 

Proof. For clarification, we say the action factors through G/K to mean that K 
stabilizes every x £ X. Towards a contradiction, suppose that fc(X,fi) ^ —00. 
Let a be a generating partition with H(a) < 00. Let <fi : G G/K be the factor 
map and let r be the rank of G. By [T^l Section B] , the exponential growth rate of 
G/K is strictly less than the exponential growth rate of G: 



lim V#(Bn)l <2r-l. 
The number of distinct partitions among g • a with g £ B n is at most |0(B n )|. So 

H(B„ • a/B n _x ■ a) < H(B„ • a) < |0(B„)| • H(a). 
After taking roots and applying the previous corollary we obtain 



2r-l= lim ^H(B n • a/B„_! • a) < lim ^ \c/)(B n )\ ■ R(a) < 2r - 1, 
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a contradiction. □ 
The above technique may work in the more general setting where 

limsup \/|B„ • x\ 

n— >oo 

is strictly less than 2r — 1 for /i-almost every x G X. However, recent work of M. 
Abert, Y. Glasner, and B. Virag [2j Theorem 8], [2 Proposition 14] shows that for 
any finitely generated free group G ^ Z there exists a measure preserving action 
of G on a standard probability space (X, fi) such that the stabilizer of x is non- 
trivial and the above limit equals 2r — 1 for /i-almost every x £ X. Therefore we 
cannot rely on the above technique to prove our main theorem. Instead, we will 
obtain tighter control over f-invariant entropy by refining the formula for f-invariant 
entropy found in Lemma 13.21 

We remark that the result of Abert-Glasner- Virag discussed above again shows 
that stabilizers occurring in probability measure preserving actions of free groups 
can be quite bizarre. Our main theorem therefore demonstrates a significant re- 
striction imposed by having finite f-invariant entropy. 

In the next theorem we show that f-invariant entropy can be defined in terms of 
independence decay. We refer the reader to the introduction for the definition of 
independence decay. Technically 6(g, S, a) depends on a total ordering of S U S^ 1 , 
but we do not emphasize this dependence. Notice that if 1q ^= g G G begins on the 
left with s E S I) S^ 1 in its reduced S'-word representation then 

s-P( S - 1 g)CP(g). 

Therefore 

S(g, S, a) = H( S - 1 5 • a/F^g) ■ a) - R(g ■ a/P(g) ■ a) 

= R{g ■ a/sPis^g) ■ a) - R{g ■ a/P{g) ■ a) > 0, 
as claimed in the introduction. Furthermore, from the equation above we see that 
5(g, S, a) measures how much g-a depends on (P(g)\sP(s~ 1 g))-a when conditioned 
on s¥(s~ 1 g) ■ a. So the independence decay at g is when the partitions g ■ a and 
(P(g) \ sP(s~ 1 g)) ■ a are independent when conditioned on sP(s~ 1 g) ■ a. 

Theorem 3.5. Let G be a finitely generated free group, let S be a free generating 
set for G, and let G act on a probability space (X,fx). Assume that there is a 
generating partition a having finite Shannon entropy. Then 

/ G (Jf,/*)=H(a)-i- J2 S (9,S,a). 

Proof. Let r denote the rank of G. As in the proof of Lemma l3~2l define 
F' G {X, fx, S, a, n) = (1 - r) • H(B„ • a) + ± ■ H(B n+1 • a/B n ■ a). 

We have 

F' G {X, n, S, a, 0) = (1 - r)H(a) + X - ■ H(B X • a/a) 

= H(a) - - • [2r ■ H(a) - H(Bi • a/a)}. 
The difference between consecutive F' G terms can be rewritten in a similar manner: 
2 • F G (X, fj,, S, a, n) - 2 • F G (X, fi, S,a,n+ 1) 
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= 2(l-r)H(B n -a)+H(B n+ i-a/B„-a)-2(l-r)H(B n+ i-a)-H(B„ +2 -a/B n+ i-a) 

= (2r - l)H(Bn+i • a/B„ • a) - H(B„ +2 ■ a/B n+1 ■ a). 
Putting these together and using Lemma 13.21 we have 

f G (X, n) = F' G {X, fx, S, a, 0) - - • £[2 • F G (X, S, a, n- 1) -2 • F' G {X, M , 5, a, n)] 

n=l 

= H(a) - - • [2r • H(a) - H(Bi • a/a)} 

1 

- o ■ 5I [(2r ~ 1 ) H ( B « ' «/ B «-i ' «) - H(B n+1 • a/B„ • a)]. 

71=1 

Let \g\ denote the reduced S'-word length of g G G. Since \g\ < \h\ implies g ^ h 
it follows from Lemma |2~T1 that 

H(B„ • a/B n _! • a) = £ H(g ■ a/P(. 9 ) • a). 
ses„ 

Since |Si| = 2r we have 

E S, a) = 2r ■ H(a) - H(Bi ■ a/a). 

seSi 

Similarly, for n > 1 each element of S„ is adjacent to 2r — 1 many points of S„ + i 
in the left S-Cayley graph of G. So 

J2 S{g, S, a) = (2r - 1)H(B„ • a/B„_! • a) - H(B„ +1 ■ a/B„ • a). 

It follows from the equation at the end of the previous paragraph that 

1 1 °° 

f G (X,n)=K(a)--- ^6(g,S,a)---Y^ E 

= H(a)--- E ^{9>S,a). 

□ 

The above theorem implies that when f G (X, fi) — oo, S(g,S,a) tends to 
as the reduced S'-word length of g tends to infinity. We mention that the values 
5(g, S, a) do not satisfy any monotone properties - the independence decay can 
be zero for a very long time and then become positive, in fact it can fluctuate 
between being positive and zero. If 8(h,S,a) = for all h >z g, then (X,fx) is 
measurably conjugate to a Markov process, and the Markov partition is P(g) • a 
(see [7]). In fact, since V(g) is left S-connected, one can always model (X,n) by an 
action for which the independence decay at h is for every h >r g (this is called a 
Markov approximation to the action, see |20jV One could take the viewpoint that 
independence decay measures the error in these Markov approximations. Then the 
above theorem would say that if f G {X, fi) ^ — oo then these Markov approximations 
converge rapidly enough to the action so that the errors are summable. 

The following simple lemma will be used in the proof of the main theorem. 
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Lemma 3.6. Let G be a finitely generated free group, let S be a free generating 
set for G, and let \g\ denote the reduced S-word length of g £ G. Let G act on a 
probability space (X,fj,). Assume that there is a generating partition a with H(a) < 
oo. If p ,pi, . . . ,p n is a path in the left S-Cayley graph of G with |po| < \pi\ < 
■ ■ ■ < \p n \, then 

n 

5(pi, S, a) = H(p • a/P(po) ■ a) - R( Pn ■ a/¥{ Pn ) ■ a). 

i=l 

Proof. The conditions on the pi imply that 

5(p h S, a) = H(pi_i • a/P(pj_i) • a) - H{ Pl ■ a/¥( Pl ) ■ a). 
It follows that all intermediary terms cancel. □ 

4. Ergodic Decomposition 

In this section we relate the f-invariant entropy of an invariant measure to the 
f-invariant entropy of the ergodic components of the invariant measure. We first 
show that f-invariant entropy is defined on the ergodic components, however we 
will find it necessary to work in a slightly more general setting. 

Definition 4.1. Let G be a countable group acting on a probability space (X, p). 
A decomposition of /i is a Borel probability measure t on dJl(X) such that /i = 
JmeOT(x) mc ^ T - A decomposition is countable if it is purely atomic. We say that r 
is a decomposition of fx into mutually singular measures if r x r-almost every pair 
of measures (m, A) £ DJt(X) x DJl(X) are either identical (m = A) or arc mutually 
singular, meaning there is a Borel set B C X with m(B) = X(X \ B) = 1. 

It is well known that distinct ergodic measures are mutually singular. So ergodic 
decompositions are decompositions into mutually singular measures. 

Lemma 4.2. Let G be a countable group acting on a probability space (X,p). Let 
t be a decomposition of p into mutually singular measures. If a is a countable 
measurable partition of X then: 

(i) if a is generating for G rx (X, p) then a is generating for G rx (X, v) for 
t -almost every v £ DJl(X); 

(ii) i/H M (a) < oo then H v (a) < oo for r-almost every v £ "tffl(X). Furthermore 
fR v (a)dT <H„(a). 

Proof, (i). Since X is by assumption a standard Borel space, there is a countable 
collection of Borel sets C such that the cr-algebra generated by C is precisely the 
collection of all Borel subsets of X. Since a is generating for G rv (X, p), we have 
that for every C £ C there is a paired set p(C) £ G ■ a with /j,(CAp(C)) = 0. We 
have 

= p(CAp(C)) = [ v{CAp{C))dT. 
Jvem(x) 

So v(C Ap(C)) = for r-almost every v £ 9Jl(X). Since C is countable, we can 
find a single set M C m{X) with r(M) = 1 for which v{C Ap(C)) = for every 
v £ M and every C £ C. Now let v £ M and let T be the collection of Borel sets 
B C X for which there is B' £ G ■ a with v(BAB') = 0. It is easily verified that 
J 7 is a a- algebra, and by definition of M we have T contains C. Therefore J* is the 
collection of Borel sets and a is a generating partition for G rv (X, v) . 
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(ii). From the definition of the Borel structure on OJt(X) (see Section [21), it 
is apparent that the functions v i-> —v{A) \og(v(A)) are Borel for A C X Borel. 
Therefore v i-)- H v (a) is a Borel function on dJl(X). So the integral J H„(a)dr 
is defined and exists since H v (a) > for all v. By the Monotone Convergence 
Theorem we have 



Define (f> : [0, 1] -> K by </>(0) = and <f>(t) = t ■ log(i) for < t < 1. One can easily 
check that the function 4> is convex. Since the function v n> v{A) clearly lies in 
C 1 (fJK(X) , t) , we can apply Jensen's Inequality to obtain 



We conclude that if H M (a) < oo then H v (a) < oo for r-almost every v E dJt(X). □ 

Theorem 4.3. Let G be a finitely generated free group of rank r acting on a 
probability space (X,[i). Assume that fa(X,fi) is defined. If t is the ergodic de- 
composition of n, then fo{X,u) is defined for r-almost every v £ &(X) and 



Proof. This is well known when G = Z ([23, Theorem 8.4]). So we assume below 
that r > 1. 

We first treat a special case. Let r be a countable decomposition of fj, (not 
necessarily the ergodic decomposition) into mutually singular measures with H(r) < 
oo. Say the atoms of r are for some countable set /. Set pi = r({z/j}) > 

0. Since the zVs are countable and mutually singular, we can find a measurable 
partition £ = {X\, X2, ■ ■ ■} of X such that Vi{Xi) = 1 for each iei. 

If (3 is any countable measurable partition of X then 









Y, E -p* ■ v ^ ■ lo g(^( B )) = I> • H -*(^)- 




In particular, if /3 refines £ we have 



H M (/3)=H M (e)+H M (/3/5) 



HM+^pi-H^CS), 



and therefore (assuming H(/3) < 00) 



F G (X, ft, S, f3) = (l- 2r)H M G8) + £ *W v 0) 
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= (l-r)H(r)+£>- ((1- 2^^) +J2 (s • /? V /3) j 

iei \ ses / 

= (1 - r)H(r) +Y,Pi- F g(X, v i} S, /3). 

Let a be a generating partition for G rv (A, /x) with H M (a) < oo. As a V £ is 
generating and H M (a V £) < H Al (£) + H^(a) = H(r) + H M (a) < oo, we can apply 
Lemma HH] to obtain 



/ G (A, ju) = lim F G (X, /i, 5, B„ • (a V 0) 

n— >oo 

= lim (l-r)H(r)+ V K • F G (A, ^, S, B„ • (a V £)) 

n— ^oo * — » 

iei 

= (l-r)H(r)+Vp i ./ G (A> i ) = / / G (X, v)dr - (r - l)H(r). 



This formula holds whenever r is a countable decomposition of /z into mutually 
singular measures with H(t) < oo. 

Now let r be the ergodic decomposition of /i. If H(r) < oo then r is countable 
and by the previous paragraph 



/ G (A, M ) = / f G (X, v)dr - (r - l)H(r). 

Now suppose that H(r) = oo. Then there is a sequence (A„) n£ N of finite measurable 
partitions of DJl(X) with H T (A„) tending to infinity as n tends to infinity. Fix n G N 
and for A G A„ define 



HA) 



Let C be the probability measure on 9H(A) which has atoms {m\ : A G A„} and 
satisfies £({m\}) = t(A). Then £ is a decomposition of /it into mutually singular 
measures and H(£) = H r (A„) < oo. By the paragraph above we have 

f G (X,n)= [ / G (I,m)<K-( r -l).H(0 < / H ro (aR-(r-l)H(C) 

< H M (a) - (r - l)H(C) = H M (a) - (r - 1) • H T (A„). 
Taking the limit as n tends to infinity we obtain / G (A, /x) = — oo. We also have 

/ / G (A>)dr - (r - l)H(r) < f R u (a)dr - (r - l)H(r) 

< H M (a) - (r - l)H(r) = -oo, 
provided the function ^ H ► f G {X,v) is Borel, so that the integral is defined (this 
is only a concern now because r may not be countable). We will resolve this 
technicality in the next paragraph and thus we will have 



f G (X, fJ ,)= / G (A»dr-(r-l)H(r) 
Jvee(X) 

in all cases. 

As we saw in the proof of Lemma 14.21 the map v H> H^(/3) is Borel for ev- 
ery countable measurable partition /3 of X. When H„(/3) < oo we have that 
Fg(X,v,S,/3) is defined. By Lemma B~2l there is a Borel set E C &(X) such 
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that t(E) = 1 and H„(B n • a) < |B„| • H„(a) < oo for all v £ E and all n £ N. It 
readily follows that v n> Fg(X, v, S, B„ • a) is a Borel function on After taking 
limits, we find that v M> fa{X,v) is a Borel function on E. This completes the 
proof. □ 

Since H(r) < oo implies that r is purely atomic, the above proof demonstrates 
the following. 

Corollary 4.4. Let G be a finitely generated non-cyclic free group acting on a 
probability space (X,ff). Assume that fc{X,p) is defined. If fc(X, /f) ^ — oo then 
the action only has countably many ergodic components. 

5. Proof of the Main Theorem 

In this section we prove our main theorem which characterizes the stabilizers 
which can appear in factors of actions having finite f-invariant entropy. 

If (X, p) is a probability space and B C X is Borel with p(B) > 0, then we 
define a probability measure ps on X by 

... p(AnB) 

for Borel sets AC X. Notice that if a, j3, and £ are countable measurable partitions 
of (X,p) then 

TI , M ,„ _ fj,(AnBr\C) , /^(infinc) 

Be/3 

Also notice that \iv = pB then vc = P-Bnc- 

Lemma 5.1. Let (X, p) be a probability space. If 5 > then there is p > such that 
if two countable measurable partitions a, (3 of X satisfy H At (a//3) < p then there is 
a subcollection (3' C f3 consisting of positive measure sets such that p(U/3') > 1 — S 
and for every B £ (3' there is A £ a with pb(A) > 1 — S. 

Proof. By making S smaller if necessary, we may suppose that S < ^ . Set 

e = -8 ■ log(<5) - (1 - 5) ■ log(l - 5) > 

and set p = e • 8. Let a and (3 be countable measurable partitions of X satisfying 
H^a/P) < p. Define 

/3' = {B£/3 : p(B) > and H Mfl (a) < e}. 

Then 

e • p(A \ U/?') < E • H MB (a) = < p. 

Be/3 

So p(X \ U/3') < p/e = (5. Thus p(U/3') > 1 - 5 as required. 

Fix B £ (3' . We must show that there is A £ a with ps(A) > 1 - 5. li A £ a 
satisfies 

S < pb(A) <1-S, 
then the partition £ = {A, X \ A} is a coarsening of a and we have 

-8 ■ \og{5) -{1-5)- log(l - 5) < H MB (£) < H jUB (a) < 6, 
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contradicting the definition of e. So for every A £ a, we have that /j,b(A) is either 
less than 8 or greater than 1 — 6. Towards a contradiction, suppose that hb(A) < S 
for all A £ a. Then we can find a set C which is a union of members of a such that 



/Ub(C) £ [| — |, ^ + |]. Then the partition 7 = {C, X \ C} is a coarsening of a so 

\ - - iog G - 9 - G + - iog G + - H - (7) 

< H MB (a) < e, 

contradicting the fact that S < ^. We conclude there is A 6 a with ^b{A) > 
1-5. " □ 

Lemma 5.2. Let G be a finitely generated free group acting on a probability space 
(X,/i). Let S be a free generating set for G and let \g\ denote the reduced S-word 
length of g £ G. Suppose that (i-almost every x £ X has non-trivial stabilizer. 
Then for every e > there exists 1q =/= g £ G for which 

[i({x £ X : g ■ x £ B| 9 |_i • x}) > 1 — e. 

Proof. Fix a total ordering of S U and let ^ be the induced well ordering of 
G (see the paragraph just before Theorem \1.5\i . For x £ X we define (f>(x) to be 
the ^-least lg ^ j e G with g ■ x — x. If a; has trivial stabilizer then we define 
4>(x) = 1q. Then (j> : X — > G is a measurable function satisfying = 

and 4>{x) ■ x — x for every x £ X . 

Fix e > and set 5 = f . Let k £ N be such that ^(0 _1 (B fc )) > 1 — S. Let 
M £ N be such that 

1 



Set Y = 0. We will inductively define g 3 £ G and Yj £ X for < j < M which 
satisfy the following two conditions for every j > 0: 

Iflfl— 1 ' f ' 

K 1 -^) <■«-'> + ji5i- (1 -'>- 

We first define gi and Yi. Since /Lt(^ _1 (B/j)) > 1 — 6, we can pick 1q ^ gi £ B^ so 
that Yi = 4>~ 1 (gi) satisfies 

KYi)>-^-Ai-s). 

The two properties above are then satisfied since g\ stabilizes every point in Y\. 
Now suppose that 1 < m < M and g rn ~\ and Y m _i have been defined and satisfy 
the two conditions above. Fix any t such that |i<7 m _i| = \t\ + \g m -i\- Since 

/i((/> _1 (Bfe)) > 1 — 5, we have that 

^(</> -1 (B fc ) \tg m -i ■ Y m _i) > 1 - S - /i(Y m _i). 

So we can find lg^ue such that Z = -1 (it) satisfies 

(i{Z \ tg m -! ■ Y m _x) > -^-r ■ (1-S - /Lt(Y m _i)). 
-tSfc 
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For h G G let Ws(h) denote the reduced S'-word representation of h. If Ws(t) 
begins (on the left) with Ws(ut) then Wsit^u^H) and Ws(t) end with the same 
letter (note that t~ 1 u~ 1 t ^ 1q). In this case we set p = t~ 1 u~ 1 t and observe that 

|P5m-l| = \p\ + \9m-l\- 

If Wsit) does not begin with Ws(ut) then Ws(i _1 iti) and Ws(ut) end with the 
same letter. Furthermore, from \u\ < \t\ we find that Ws(ut) and Wg(t) end with 
the same letter. So in this case we set p = t~ l ut and again observe that 

\P9m-l\ = \P\ + \9m-l\- 

We set g m = pg m -i and Y m — Y m -\ U (tg m -i) ■ Z. We now check that the 
two inductive hypothesis are satisfied. Fix y G Y m . First suppose that y G Y m — \. 
Then 

g m -y = pg m -i • V 6 pB| Sm _ 1 |_ 1 • y C B| p | + | flm _ 1 |_ 1 • y = B| ffm |_! • y. 

Now suppose that y € \ Km-i. Set 2 = tg m -i ■ y. Then z € Z = (u). It 
follows 

3m • y = P3m-i • y = t _1 u ±1 i3m-i • 2/ = ^u* 1 ■ 2 
= t~ 1 ■ z = g m _i ■ y £ B| Sm _ 1 | • j/ C B| flm |_i • y. 
So the first condition is satisfied. For the second, we have 

p(Y m ) = M(^m-i) + K Y m \ Y m -i) = ^(y m _i) + \ tg m -i ■ y m -i) 

> M(^n-l) + • (1 - <* - M(^m-l)) = (l - T^n) M(^n-l) + T^T • (1 - <*)■ 

This completes the inductive construction. 

By our condition on the measures of the Yj 's we have 

1 / 1 V 

, (yM)> _ (1 _, ) .g( 1 __) 
-('-^•('-(i-^^i-m-i-- 

Using g — gu completes the proof. □ 
We now prove a weakened version of the main theorem. 

Proposition 5.3. Let G be a finitely generated non-cyclic free group acting on a 
probability space (X,/i). Assume that /o(X, /i) is defined. If G rx (X,fi) is ergodic 
and fc{X,pL) 7^ — oo then either the action is essentially free or else fi is purely 
atomic. 

Proof. Let a be a generating partition with H(a) < oo. Let S be a free generating 
set for G, let r = \S\ be the rank of G, and let \g\ denote the reduced S- word 
length of g G G. Also fix a total ordering of S U S 1 " 1 , let ^ be the induced well 
ordering of G (see the paragraph before Theorem 1 1.5|) . and let F(g) denote the set 
of group elements strictly preceding g. Assume that G rx (X, p) is ergodic and not 
essentially free and that pL is not purely atomic. We will show that fa{X, /i) = — oo. 
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If H(B„ • a/B„_i • a) does not tend to infinity then /g(X, p) = — oo by Corollary 
13.31 and we are done. So suppose that H(B„ • a/B n _i • a) tends to infinity. Fix 

< e < 1/2 and let n G N be such that 

H(B n • a/B n _i • a) > i. 

Set 5 = e/(6 • |S n |). By combining finitely many classes of a into a single class, we 
can obtain a (possibly trivial) partition (5 with H(/3) < 5. Say Ai, A2, . . . , Am € a 
were combined into a single class to form /3. Set fl = Ai U A 2 U • • • U Am so that 
{73} = /3 \ a. Let cum = {A' 1; A 2 , . . . , A M } be any measurable partition of X with 
Aj C A^ for each i. Notice that for any probability measure v on X (not necessarily 
G-invariant) and any h G G we have 

H„(/i • a/A • /3) = u(h ■ B) ■ R Vh B {h-a) = v(h ■ B) ■ R„ h . B (h ■ a M ) 

< v(h ■ B) ■ log(M) < log(M). 

Since the action is ergodic and not essentially free, the stabilizer of /i-almost every 
iG^is non-trivial. By the previous lemma, there is 1q 7^ g G G such that 

Li{{x eX : g ■ x G B| ff |_ x ■x})>l-5j log(M). 

For x G X, define ip(x) to be the ^-least element of G satisfying g ■ x — tp(x) ■ x. 
Note that /j,(tp~ 1 (g)) < 5/\og(M). Since ip is measurable and its image is finite, it 
induces a finite measurable partition, £, of X. Let < k < 6/ log(M) be such that 

— k ■ log(/t) — (1 — k) ■ log(l — k) < 5. 

By Lemma l5.1[ there exists p > such that if x is a countable measurable partition 
of X satisfying H(£/x) < p, then there is a subcollection CC^ consisting of positive 
measure sets such that //(UC) > 1 — n and for every G G C there is E G £ with 
^c{E) > 1 — k. Let TV > n + |<j>| be such that 

H(C/B^_! • a) < p. 

Such an TV exists since a is a generating partition. 

Fix u> G S„. Let F(w) be the set of w ^ f G G for which |/| = |/w _1 | + 
We claim that 

53 f(/,S 1 a)>H(ra/P(t«).a)~. 

Let i G G be such that g~ 1 tw G Stv and \g~ 1 tw\ = | _g 1 1 + \t\ + \w\. Then 
g~ 1 tw G F(w). Let po = w,pi,p2, ■ ■ ■ ,Pk — g~ 1 tw be the sequence of vertices in 
the path from w to <? _1 iu> in the left S-Cayley graph of G. Then pi G for 

1 > 0. By Lemma |3~51 

k 

53 <*(/, 5, a) > ^ ^Oi, S, a) = E(w ■ a/F{w) ■ a) - YL{g~Hw ■ a/¥{g~Hw) ■ a). 

fGF(w) i=l 

So it suffices to show that H(g~ 1 tw ■ a/¥(g~ 1 tw) ■ a) < e/|S n |. 
We have F(g~Hw) 2 &n-i so 

K(£/F(g-Hw) ■ a) < H(e/B iV - 1 ■ a) < p. 

Thus there is a collection C C F(g~ 1 tw) ■ a consisting of positive measure sets such 
that n(UC) > 1 — K and for every G G C there is E G £ with nc{E) > 1 — «. 
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Let C C C consist of those C £ C for which there is E € £ with E ^ ip^ 1 (g) and 
Hc{E) > 1 — K. Observe that 

/x(UC') > 1 - k - ^ log(M) > 1 - k - 26/ log(M) 

(1 — ft) 

since fi(tp~ 1 (g)) < 5/\og(M). Fix C € C and let /i G B| ff |_i be such that 
Hc(tl>~ 1 (h)) > 1 - k. As C £ F(g-Hw) ■ a and /i^iw e F(g-Hw), there is 
A 6 a with C C h~Hw ■ A. Since acts trivially on n C, we have 

• A = g^h-ih-Hw ■ A) d f'/i-tf'ffcjnC) = f'(/i)na 

Therefore 

Hcig^tw ■ A) > (icty- 1 (h)) > 1 - k. 
So g~ 1 tw ■ A covers most of C. If A g /3 then fic(g tw ■ B) < n and therefore 

H MO (5 _1 tto • a/g~Hw ■ j3) < n c {g-Hw ■ B) ■ log(M) < k • log(M). 
On the other hand, if A g" j3 then A = Aj C A\ for some i, A C B, and 

Hcng-HwBid^tw ■ A'i) > fic(g~Hw ■ A) > 1 - k. 
Thus in this case 

h mc (3 -1 ^ ■ a/g~Hw ■ (3) = ndg^tw ■ B) ■ ^an a -HwB (d^tw ■ a M ) 

- R Hcn a -H«,. B (9~ ltw ■ «m) < ~« ' l0g(«) - (1 - «) • l0g(l -K)+K- l0g(M - 1). 

So in either case we have 

Hjuc (.g -1 ^ • a/g-Hw ■ ff) < -k ■ log(«) - (1 - k) ■ log(l -«) + «• log(M) < 28. 
This holds for every C £ C. It follows that 

R(g~Hw ■ a/¥(g~Hw) ■ a) 
= K(g- 1 tw ■ /3/P(g _1 tw) ■ a) + K(g- 1 tw ■ a/P{g~Hw) ■ a V g^tw ■ f3) 
<5 + H{g~ l tw ■ a/¥(g~ 1 tw) ■ a V g~Hw ■ /3) 
= 6+ Yl ■ H A"c ■ u/g'hw ■ (i) 

CeP(g^ 1 tw)-a 

<S + fi(X\ UC') • log(M) + • H MC (g-Hw ■ a/g^tw ■ 0) 



cec 
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This justifies our claim. 
Now we compute 

J2 S(h,S,a)> J2 E 

> ^ (h(io ■ a/P(ic) • a) - -LS\ = H(B„ • a/B n ^ ■ a) - e > - - e. 

By letting e tend to zero we find that the cumulative independence decay is infinite. 
Therefore f a (X, fi) = -oo by Theorem [33] □ 
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Theorem 5.4. Let G be a finitely generated non-cyclic free group acting on a 
probability space (X,/x). Assume that /g(X, fj,) is defined. If fa{X, fi) ^ — oo and 
(Y, v) is any factor of (X,fi), then for v-almost every y G Y , the stabilizer of y 
is either trivial or has finite index in G. Furthermore, v-almost every y G Y with 
non-trivial stabilizer is an atom, and thus there are essentially only countably many 
points with non-trivial stabilizer. 

Proof. To be clear, when we say that (/-almost every y G Y with non-trivial stabi- 
lizer is an atom, we mean that there is a Borel set F'CF such that v(Y') — 1 and 
every y G Y' with non-trivial stabilizer is an atom. 

Let r > 1 be the rank of G. First suppose that G rx (Y, v) is ergodic. If v is 
purely atomic then by ergodicity there are only finitely many atoms and they lie 
in a single orbit. In particular their stabilizers have finite index in G. So suppose 
that v is not purely atomic. By ergodicity v has no atoms. Let a be a generating 
partition for G rx (X, fi) with H M (a) < oo. Let n G N be such that 

-(r-l)Iog(n) <f a (X,n)-S. M {a). 

Since v has no atoms, we can find a measurable partition /3 of Y which has precisely 
n classes of positive measure. The sub-cr-algebra G ■ ft gives rise to a factor map 
(Y, v) h4 (Y', v'). Moreover, f3 pushes forward to a partition f3' of (Y 1 , v'), and /?' 
has n classes of positive measure. Also f3' is a generating partition for G rx (Y 1 , v') 
and thus fc(Y' , v') is defined since /?' is finite. By Theorem 12. 2\ 

fc(Y', v 1 ) > f G (X, n)-E M (a)>-(r-l) log(n). 

By Proposition [531 either G rx (Y', v') is essentially free or v 1 is purely atomic. If 
v' were purely atomic, then it would have to have at least n atoms since each class 
of f3' has positive measure. Every atom would have the same measure by ergodicity, 
so by Lemma T2.3I we would have fo(Y', v') < — (r — 1) log(n), a contradiction. So 
G rx (Y', v 1 ) is essentially free. It follows that G rx (Y, v) must be essentially free 
as well. 

Now consider the general case in which G rx (Y, v) is not ergodic. Let r be the 
ergodic decomposition of /i. By Corollary 14.41 r is purely atomic. Let E C <B(X) 
be the set of atoms of r. By Theorem l4.3[ fa(X, A) is defined for every A G E and 

f G (X, = £ r({A}) ■ f G (X, A) - (r - l)H(r). 

Since fa(X,fj) ^ — oo, we must have fa{X, A) ^ — oo for every A G E. Each 
measure A G E pushes forward to an ergodic measure A* on Y, and we have 

By the previous paragraph A* is either purely atomic or else G rx (Y, A* ) is essen- 
tially free. Of course, when A* is purely atomic the stabilizer of A*-almost every 
point has finite index in G. So if we let B be the set of all y G Y for which the 
stabilizer of y is neither trivial nor has finite index in G, then A* (B) — for every 
A G E. By the above decomposition of v, v(B) = and thus the stabilizer of 
^-almost every y G Y is either trivial or has finite index in G. Now let A C Y be 
the set of all atoms of v. So y G A if and only if v({y}) > 0. Also let C be the set 
of y G Y for which the stabilizer of y is non-trivial. Then A* (C \ A) =0 for every 
A G E and thus v(C \ A) = 0. So setting Y' = Y \ (C \ A), we have v(Y f ) = 1 and 
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every y G Y' with non-trivial stabilizer lies in Y 1 D C C A and is thus an atom of 
v. □ 

We now prove Corollary 11.21 from the introduction. 

Corollary 5.5. Let G be a finitely generated non-cyclic free group acting on a 
probability space (X,fi). Assume that fc{X,fi) is defined. If G rx (X,fJ,) is ergodic 
and fG(X,fi) 7^ — oo, then there is n G N such that for every subgroup T < G 
containing a non-trivial normal subgroup of G the number of ergodic components 
ofTrx (X, fx) is at most n. 

Proof. Let a be a generating partition for G rx (X,fx) with H(a) < oo. Let n be 
maximal such that 

-(r-l)log(n) >/ G (Jf,/i)-H(a). 
Such an n exists since we are assuming r > 1 . Let L < G be a subgroup containing 
a non-trivial normal subgroup of G. Say K ^ {1g} is normal in G and K < T. Let 
tk be the ergodic decomposition of K rx (X, fj). Then G acts on (*ffl K {X), tk) and 
this action is a factor of G rx (X, /i) (the factor map is induced by the sub-cr-algebra 
of all K- invariant Borel sets). For tk -almost every A G 9Jt^(X) the G-stabilizer 
of A contains K and is thus non-trivial. So Theorem 15.41 implies that tk is purely 
atomic. By ergodicity there are finitely many atoms, each with the same measure. 
So if there are m atoms of tk then by Theorem 12.21 and Lemma 12.31 

-(r-l)log(m) = f G (m K (X),T K ) > / G (X,/i)-H(a). 

Therefore m, the number of ergodic components of K rx (X, /i), is at most n. Since 
K < F, every T ergodic component of F rx (X, /i) contains at least one iC-ergodic 
component. Therefore the number of T-ergodic components is at most n. □ 

The above corollary cannot be extended to hold for all non-trivial subgroups 
of G. This is demonstrated by the following example provided by Lewis Bowen 
(private communication) . 

Proposition 5.6 (Lewis Bowen). Let G be a finitely generated non-cyclic free 
group. Then there exists an ergodic action G rx (X, fi) and a non-trivial subgroup 
r <G such that fc{X,fi) is defined and finite (i.e. /c(X,/i) ^ — oo), but there are 
infinitely many ergodic components ofTrx (X, /i) . 

Proof, let S be a free generating set for G. Also let ^ be a well ordering of G as 
described in the paragraph above Theorem 11.51 and let P(g) be the set of group 
elements strictly preceding g. Fix t G S and set T = (t). Let G/T = {gT : g G G} 
be the set of all left cosets of T. 

Let v be the probability measure on the positive integers, N+, defined by v{{n}) = 
2~ n . It is easily computed that H(i^) = 2 • log(2) < oo. Let X be the set of all 
functions from G/T to N+. Equivalently, 

X = N+ /r = Y[ N+. 

grec/r 

We let /i = v G l v be the product measure. We let G act on X by permuting coordi- 
nates on the left: (h ■ x)(gT) = x(h~ 1 gT). Since G acts by permuting coordinates, 
it readily follows that fi is G-invariant. The action G rx (X,fi) is quite similar to 
a Bernoulli shift. After a minor and obvious modification, the standard argument 
that Bernoulli shifts are ergodic shows that G rx (X, fi) is ergodic as well. 
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Let a — {Ai : i £ N+} be the partition of X given by Ai = {x £ X : x(T) = i}. 
Then a is a generating partition for G rx (X,/i) and H^a) = H(z/) < oo. So 
fc(X,fi) is defined. Notice that 7 • a = a for every 7 £ T. Since values at distinct 
coordinates are independent, it is not difficult to check that H(g ■ a/P(g) ■ a) is if 
gT C P(g)-T and is otherwise equal to H(a). Since t £ S and T = (i), it follows that 
H(g-a/P(g)-a) equals if the reduced 5-word representation of g ends with t or t^ 1 
and is otherwise equal to H(a). So if 1q ^ g £ G and s £ S satisfy | s 1 .<7 1 = \g\ — 1 7 
then E(g ■ a/P(g) ■ a) = H(s" 1 g • a/V(s- 1 g) ■ a) unless g € It; follows 

that S(g, S, a) = for g £ {lcM" 1 } and 6(t,S,a) = ^(t- 1 ,^,^) = H(a). So 
fc{X,fi) = by Theorem 13.51 (alternatively, one could observe that G rx (X,n) 
is measurably conjugate to a Markov process and apply Theorem 11.1]). So 
G rx (X,n) is ergodic, /g(X, //) is defined, and Jg(X,^) = ^ —00. Finally, we 
have 7 • A = A for every A £ a and 7 £ T, so there are at least as many ergodic 
components of T rx (X, fx) as there are members of a with positive measure. We 
conclude that there are infinitely many ergodic components of T rx (X, fi) . □ 
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